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Abstract 

A cycle cover of a graph is a set of cycles such that every vertex is part of exactly 
one cycle. An L-cycle cover is a cycle cover in which the length of every cycle is in 
the set LCN. 

We investigate how well L-cycle covers of minimum weight can be approximated. 
For undirected graphs, we devise a polynomial-time approximation algorithm that 
achieves a constant approximation ratio for all sets L. On the other hand, we prove 
that the problem cannot be approximated within a factor of 2 — e for certain sets L. 

For directed graphs, we present a polynomial-time approximation algorithm that 
achieves an approximation ratio of 0{n), where n is the number of vertices. This is 
asymptotically optimal: We show that the problem cannot be approximated within a 
factor of o(n). 

To contrast the results for cycle covers of minimum weight, we show that the 
problem of computing L-cycle covers of maximum weight can, at least in principle, be 
approximated arbitrarily well. 

Keywords: Combinatorial Optimization, Approximation Algorithms, Graph Algo- 
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1 Introduction 

A cycle cover of a graph is a spanning subgraph that consists solely of cycles such that 
every vertex is part of exactly one cycle. Cycle covers are an important tool for the design 
of approximation algorithms for different variants of the traveling salesman problem [3,5,6, 
9-12,21], for the shortest common superstring problem from computational biology [8,29], 
and for vehicle routing problems [18]. 

In contrast to Hamiltonian cycles, which are special cases of cycle covers, cycle covers 
of minimum weight can be computed efficiently. This is exploited in the above mentioned 
algorithms, which in general start by computing a cycle cover and then join cycles to 
obtain a Hamiltonian cycle (this technique is called subtour patching [14]). 
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Short cycles limit the approximation ratios achieved by such algorithms. Roughly 
speaking, the longer the cycles in the initial cover, the better the approximation ratio. 
Thus, we are interested in computing cycle covers without short cycles. Moreover, there 
are algorithms that perform particularly well if the cycle covers computed do not contain 
cycles of odd length [5]. Finally, some vehicle routing problems [18] require covering 
vertices with cycles of bounded length. 

Therefore, we consider restricted cycle covers, where cycles of certain lengths are ruled 
out a priori: For a set L C N, an L-cycle cover is a cycle cover in which the length of each 
cycle is in L. 

Unfortunately, computing L-cycle covers is hard for almost all sets L [20,24,25]. Thus, 
in order to fathom the possibility of designing approximation algorithms based on com- 
puting cycle covers, our aim is to find out how well L-cycle covers can be approximated. 

Beyond being a basic tool for approximation algorithms, cycle covers arc interesting in 
their own right. Matching theory and graph factorization are important topics in graph 
theory. The classical matching problem is the problem of finding one- factors, i. e., spanning 
subgraphs in which every vertex is incident to exactly one edge. Cycle covers of undirected 
graphs are also called two-factors since every vertex is incident to exactly two edges in 
a cycle cover. Both structural properties of graph factors and the complexity of finding 
graph factors have been the topic of a considerable amount of research (cf. Lovasz and 
Plummer [22] and Schrijver [28]). 

1.1 Preliminciries 

Let G = (y, E) be a graph with vertex set V and edge set E. If G is undirected, then 
a cycle cover of G is a subset G Q E of the edges of G such that all vertices in V are 
incident to exactly two edges in G. If G is a directed graph, then a cycle cover of G is a 
subset G C. E such that all vertices are incident to exactly one incoming and one outgoing 
edge in G. Thus, the graph {V, G) consists solely of vertex-disjoint cycles. The length of 
a cycle is the number of edges it consists of. We are concerned with simple graphs, i. e., 
the graphs do not contain multiple edges or loops. Thus, the shortest cycles of undirected 
and directed graphs are of length three and two, respectively. We call a cycle of length A 
a A-cycle for short. 

An L-cycle cover of an undirected graph is a cycle cover in which the length of 
every cycle is in the set L C = {3, 4, 5, . . .}. An L-cycle cover of a directed graph is 
analogously defined except that L C X) = {2, 3, 4, . . .}. A special case of L-cycle covers 
are fc-cycle covers, which are {k,k + 1, . . .}-cycle covers. Let L = U \ L in the case of 
undirected graphs, and let L = P \ L in the case of directed graphs (whether we consider 
undirected or directed cycle covers will be clear from the context). 

Given edge weights w : E ^ N, the weight w{C) of a subset G C E of the edges of 
G is w{G) = X^eGC^(^)- particular, this defines the weight of a cycle cover since we 
view cycle covers as sets of edges. 

Min-L-UCC is the following optimization problem: Given an undirected complete 
graph with non-negative edge weights that satisfy the triangle inequality (w{{u,v}) < 
'w{{u,x}) + w{{x,v}) for all u,x,v G V) find an L-cycle cover of minimum weight. Min- 
fc-UCC is defined for k ^ U like Min-L-UCC except that A;-cycle covers rather than 
L-cycle covers are sought. The triangle inequality is not only a natural restriction, it is 
also necessary: If finding L-cycle covers in graphs is NP-hard, then Min-L-UCC without 
the triangle inequality does not allow for any approximation at all. 

Min-L-DCC and Min-fc-DCC are defined for directed graphs like Min-L-UCC and 
Min-Z^-UCC for undirected graphs except that L C.V and k € V and the triangle inequality 
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is of the form ■w{u, v) < w{u, x) + w{x, v). 

Finally, Max-L-UCC, Max-fc-UCC, Max-L-DCC, and Max-fc-DCC are analo- 
gously defined except that cycle covers of maximum weight are sought and that the edge 
weights do not have to fulfill the triangle inequality. 

1.2 Previous Results 

Undirected Cycle Covers. Min-^-UCC, i. e., the undirected cycle cover problem with- 
out any restrictions, can be solved in polynomial time via Tutte's reduction to the classical 
perfect matching problem [22]. By a modification of an algorithm of Hartvigsen [17], also 
4-cycle covers of minimum weight in graphs with edge weights one and two can be com- 
puted efficiently. For Min-fc-UCC restricted to graphs with edge weights one and two, 
there exists a factor 7/6 approximation algorithm for all k [7]. Hassin and Rubinstein [19] 
presented a randomized approximation algorithm for Max-{3}-UCC that achieves an ap- 
proximation ratio of 83/43-1- e. Max-L-UCC admits a factor 2 approximation algorithm 
for arbitrary sets L [23,25]. Gocmans and Williamson [15] showed that Min-Zc-UCC and 
Min-{A;}-UCC can be approximated within a factor of 4. Min-L-UCC is NP-hard and APX- 
hard if L 2 {3}, i. e., for all but a finite number of sets L [20,24,25,30]. This means that 
for almost all L, these problems are unlikely to possess polynomial-time approximation 
schemes (PTAS, sec Ausiello ct al. [2] for a definition). 

If Min-L-UCC is NP-hard, then the triangle inequality is necessary for efficient ap- 
proximations of this problem; without the triangle inequality, Min-L-UCC cannot be ap- 
proximated at all. 

Directed Cycle Covers. Min-'D-DCC, which is also known as the assignm,ent problem^ 
can be solved in polynomial time by a reduction to the minimum weight perfect matching 
problem in bipartite graphs [1]. The only other L for which Min-L-DCC can be solved 
in polynomial time is L = {2}. For ah L C I? with L ^ {2} and L ^V, Min-L-DCC 
and Max-L-DCC are APX-hard and NP-hard, even if only two different edge weights are 
allowed [24,25]. 

There is a 4/3 approximation algorithm for Max-3-DCC [6] as well as for Min-/s-DCC 
for A; > 3 with the restriction that the only edge weights allowed are one and two [4]. 
Max-L-DCC can be approximated within a factor of 8/3 for all L [25]. 

Analogously to Min-L-UCC, Min-L-DCC cannot be approximated at all without the 
triangle inequality. 

1.3 New Results 

While L-cycle covers of maximum weight allow for constant factor approximations, only 
little is known so far about the approximability of computing L-cycle covers of minimum 
weight. Our aim is to close this gap. 

We present an approximation algorithm for Min-L-UCC that works for all sets L C.U 
and achieves a constant approximation ratio (Section 2.1). Its running-time is 0{n? logn). 
On the other hand, we show that the problem cannot be approximated within a factor of 
2 — e for general L (Section 2.2). 

Our approximation algorithm for Min-L-DCC achieves a ratio of 0(n), where n is the 
number of vertices (Section 3.1). This is asymptotically optimal: There exists sets L for 
which no algorithm can approximate Min-L-DCC within a factor of o(n) (Section 3.2). 
Furthermore, we argue that Min-L-DCC is harder to approximate than the other three 
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variants even for more "natural" sets L than the sets used to show the inapproximabihty 
(Section 3.3). 

Finally, to contrast our results for Min-L-UCC and Min-L-DCC, we show that Max- 
L-UCC and Max-L-DCC can be approximated arbitrarily well at least in principle (Sec- 
tion 4). 

2 Approximability of Min-L-UCC 

2.1 An Approximation Algorithm for Min-X-UCC 

The aim of this section is to devise an approximation algorithm for Min-L-UCC that works 
for all sets L CU. The catch is that for most L it is impossible to decide whether some 
cycle length is in L since there arc uncountably many sets L: If, for instance, L is not a 
recursive set, then deciding whether a cycle cover is an L-cycle cover is impossible. One 
option would be to restrict ourselves to sets L such that the unary language {1'^ | A € L} is 
in P. For such L, Min-L-UCC and Min-L-DCC are NP optimization problems (see Ausiello 
et al. [2] for a definition). Another possibility for circumventing the problem would be to 
include the permitted cycle lengths in the input. While such restrictions are mandatory 
if we want to compute optimum solutions, they are not needed for our approximation 
algorithms. 

A complete n-vcrtcx graph contains an L-cycle cover as a spanning subgraph if and 
only if there exist (not necessarily distinct) lengths Ai, . . . , G L for some /c € N with 
Z^i=i = ^- We call such an n L-admissible and define (L) = {n | n is L-admissible}. 
Although L can be arbitrarily complicated, (L) always allows efficient membership testing 
according to the following lemma. 

Lemma 2.1 (Manthey [25, Lem. 3.1]). For all L C N, there exists a finite set L' C L 
with (L') = (L). 

Let ql be the greatest common divisor of all numbers in L. Then (L) is a subset of the 
set of natural numbers divisible by gL. The proof of Lemma 2.1 shows that there exists 
a minimum p/, € N such that rjgL € (L) for all > pL- The number pi^ is the Probenius 
number [27] of the set {A | G L}, which is L scaled down by gi- For instance, if 
L = {8, 10}, then gi = 2 and pi = 11 since the Frobenius number of {4, 5} is 11. 

In the following, it suffices to know such a finite set L' C L. The L-cycle covers 
computed by our algorithm will in fact be L'-cycle covers. In order to estimate the 
approximation ratio, this cycle cover will be compared to an optimal (L')-cycle cover. 
Since L' C L C (L'), every L'- or L-cycle cover is also a (L')-cycle cover. Thus, the weight 
of an optimal (L')-cycle cover provides a lower bound for the weight of both an optimal 
L'- and an optimal L-cycle cover. For simplicity, we do not mention L' in the following. 
Instead, we assume that already L is a finite set, and we compare the weight of the L-cycle 
cover computed to the weight of an optimal (L)-cycle cover to bound the approximation 
ratio. 

Goemans and Williamson have presented a technique for approximating constrained 
forest problems [15], which we will exploit. Let G = (V, E) be an undirected graph, and let 
w : E ^ N he non-negative edge weights. Let 2^ denote the power set of V. A function 
/ : 2^ ^ {0, 1} is called a proper function if it satisfies 

• f{S) = f{V \ S) for allSCV (symmetry), 

• if A and B are disjoint, then f{A) = f{B) = implies f{A U B) = (disjointness), 
and 
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• f{V) = 0. 



The aim is to find a set F of edges such that there is an edge connecting StoV\SioT all 
S C.V with f{S) = 1. (The name "constrained forest problems" comes from the fact that 
it suffices to consider forests as solutions; cycles only increase the weight of a solution.) 
For instance, the minimum spanning tree problem corresponds to the proper function / 
with f{S) = 1 for ah S with $CSgV. 

Goemans and Williamson have presented an approximation algorithm [15, Fig. 1] for 
constrained forest problems that are characterized by proper functions. We will refer to 
their algorithm as GoeWill. 

Theorem 2.2 (Goemans and Williamson [15, Thm. 2.4]). Let i be the number of vertices 
V with /({f}) = 1. Then GoeWill is a (2 — |)-approximation for the constrained forest 
problem defined by a proper function /. 

In particular, the function fi given by 

f (q\ - j ^ if 151 # (mod gi) and 
m^) - I if |5| = (mod ql) 

is proper if \V\ = n is divisible by gi- (If n is not divisible by g^, then G does not contain 
an L-cyclc cover at all.) Given this function, a solution is a forest H = {V,F) such that 
the size of every connected component of is a multiple of gL. In particular, if g^ = 1, 
then fL{S) = for all S, and an optimum solution are n isolated vertices. 

If the size of all components of the solution obtained are in (L), we are done: By 
duplicating all edges, we obtain Eulerian components. Then we construct an (L)-cycle 
cover by traversing the Eulerian components and taking shortcuts whenever we come to a 
vertex that we have already visited. Finally, we divide each A-cycle into paths of lengths 
Ai — 1, . . . , Afc — 1 for some k such that Ai + . . . + A^ = A and Xi £ L for all ?'. By 
connecting the respective endpoints of each path, we obtain cycles of lengths Ai, . . . , A^. 
We perform this for all components to get an L-cycle cover. A straightforward analysis 
yields an approximation ratio of 8. A more careful analysis shows that the actual ratio 
achieved is 4. The details for the special case of L = {k} are spelled out by Goemans and 
Williamson [15]. 

However, this procedure does not work for general sets L since the sizes of some 
components may not be in (L). This can happen if pL > (for L = {k}, for which the 
algorithm works, we have = 0). At the end of this section, we argue why it seems to 
be difficult to generalize the approach of Goemans and Williamson in order to obtain an 
approximation algorithm for Min-L-UCC whose approximation ratio is independent of L. 

In the following, our aim is to add edges to the forest H = {V, E) output by GoeWill 
such that the size of each component is in (L) . This will lead to an approximation algo- 
rithm for Min-L-UCC with a ratio of 4 • [pL + 4), which is constant for each L. Let F* 
denote the set of edges of a minimum- weight forest such that the size of each component is 
in (L). The set F* is a solution to G, and /l, but not necessarily an optimum solution. 

By Theorem 2.2, we have w{F) < 2 • w{F*) since w{F*) is at least the weight of an 
optimum solution to G, w, and f^. Let C = (y',F') be any connected component of 
F with \V'\ ^ (L). The optimum solution F* must contain an edge that connects V' 
to V \V'. The weight of this edge is at least the weight of the minimum- weight edge 
connecting V' to V \ V' . 

We will add edges until the sizes of all components is in (L). Our algorithm acts in 
phases as follows: Let H = (V, F) be the graph at the beginning of the current phase. 
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and let Ci, . . . ,Ca be its connected components, where Vi is the vertex set of Cj. We 
win construct a new graph H = (y,F) with F ^ F. Let Ci, . . . ,Ch be the connected 
components with \Vi\ ^ {L). We call these components illegal. For i £ {1, ... ,5}, let 
be the cheapest edge connecting Vi to V \ Vi. (Note that = ej for i ^ j is allowed.) 

We add all these edges to F to obtain F = F U {ei, . . . , et}. Since Cj is the cheapest 
edge connecting Vi to V\Vi, the graph H = {V, F) is a forest. (If some Cj are not uniquely 
determined, cycles may occur. We can avoid these cycles by discarding some of the to 
break the cycles. For the sake of simplicity, we ignore this case in the following analysis.) 
If H still contains illegal components, we set H tohe H and iterate the procedure. 

Lemma 2.3. Let F and F be as described above. Then w{F) < w{F) + 2 • w{F*). 

Proof. We observe that F* contains at least one edge e* connecting Vi to V \ Vi for every 
i e {1, . . . ,b}. If e* = ej for i ^ j, then ^ e* for all k ^ This means that every 
edge occurs at most twice among , . . . , , which implies 

Y,w{e:)<2.w{F*). 

i=l 

By the choice of Cj, we have w{ei) < w(eT). Putting everything together yields 

b b 

w{F) < w{F) + w{ei) < w{F) + ^ u;(e*) < w{F) + 2w{F*). 
i=l 1=1 

□ 

Let us bound the number of phases that are needed in the worst case. 

Lemma 2.4. After at most \j>l/2\ + 1 phases, H does not contain any illegal components. 

Proof. In the beginning, all components of H = (y,F) contain at least gi vertices. If 
gL G L, no phases are needed at all. Thus, we can assume that min(L) > 2gi^. 

To bound the number of phases needed, we will estimate the size of the smallest illegal 
component. Consider any of the smallest illegal components before some phase t, and let 
s be the number of its vertices. In phase t, this component will be connected either to 
another illegal component, which results in a component with a size of at least 2s, or to 
a legal component, which results in a component with a size of at least s + 2gL. (It can 
happen that more than two illegal components are connected to a single component in 
one phase.) 

In either case, except for the first phase, the size of the smallest illegal component 
increases by at least 2gL in every step. Thus, after at most [pl/2J + 1 phases, the size of 
every illegal component is at least {pL + 1)5l- Hence, there are no more illegal components 
since components that consist of at least {pL + l)gL vertices are not illegal. □ 

Eventually, we obtain a forest that consists solely of components whose sizes are in 
(L). We call this forest H = (y,F). Then we proceed as already described above: We 
duplicate each edge, thus obtaining Eulerian components. After that, we take shortcuts 
to obtain an (L)-cycle cover. Finally, we break edges and connect the endpoints of each 
path to obtain an L-cyclc cover. The weight of this L-cycle cover is at most 4 • w{F). 

Overall, we obtain ApxUndir (Algorithm 1) and the following theorem. 

Theorem 2.5. For every L C U, ApxUndir is a factor (4 • [pL + 4)) approximation 
algorithm for Min-L-UCC. Its running-time is 0(ri^ log n). 
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Algorithm 1 ApxUndir. 



Input: undirected complete grapti G = {V,E), \V\ = n; edge weights w : E ^ N satisfy- 
ing the triangle inequality 
Output: an L-cycle cover C^p^ of G if n is L-admissible, _L otherwise 
1: if n ^ (L) then 

2: return ± 

3: run GoeWill using the function described in the text to obtain H = (V, F) 

4: while the size of some connected components of H is not in (L) do 

5: let Ci, . . . , Ca be the connected components of H, where Vi is the vertex set of Cf, 

let Ci, . . . , C5 be its illegal components 
6: let Ci be the lightest edge connecting to V \ Vi 
7: add ei , . . . , to F 
8: while H contains cycles do 
9: remove one Cj to break a cycle 

10: duplicate each edge to obtain a multi-graph consisting of Eulerian components 

11: for all components of the multi-graph do 

12: walk along an Eulerian cycle 

13: take shortcuts to obtain a Hamiltonian cycle 

14: discard edges to obtain a collection of paths, the number of vertices of each of 
which is in L 

15: connect the two cndpoints of every path in order to obtain cycles 
16: the union of all cycles constructed forms C"^p^; return C^^^ 



Proof. Let C* be a minimum-weight (-L)-cycle cover. The weight of F is bounded from 
above by 

w{F) < Q^J + 1) . 2 ■ w{F*) + 2 • w{F*) < {pL + 4) ■ w{C*). 

Combining this with w{C'^p^) < 4 • w{F) yields the approximation ratio. 

Executing GoeWill takes time O(n^logn). All other operations can be implemented 
to run in time 0{n?). □ 

We conclude the analysis of this algorithm by providing an example that shows that 
the approximation ratio of the algorithm depends indeed linearly on p^. To do this, let 

p E N be even. We choose L = {4, 2p + 2, 2p + 4,2p + 6, . . .}. Thus, gi = 2 and pi = p—1. 
Figure 1 shows the graph that we consider and its optimal L-cycle cover. The graph 
consists of 4p + 4 vertices. The weights of the edges, which satisfy the triangle inequality, 
are as follows: 

• Solid, bold edges have a weight of 1. 

• Dashed, bold edges have a weight of 1 -|- e, where £ > can be made arbitrarily 
small. 

• Solid, non-bold edges have a weight of e. 

• Dashed, non-bold edges have a weight of 2e. 

• The weight of the edges not drawn is given by the shortest path between the respec- 
tive vertices. 
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p/2 groups of four vertices 




(a) The graph. (b) The optimal L-cycle cover. 



Figure 1: An example on which ApxUndir achieves only a ratio of roughly pl/2. 

The weight of the optimum L-cycle cover is 2 + {6p + 4)e: The four central vertices 
contribute 2 + Ae, and each of the p remaining 4-cycles contributes 6e. By decreasing e, 
the weight of the optimum L-cycle cover can get arbitrarily close to 2. 

Figure 2 shows what ApxUndir computes. Let us assume that GoeWill returns the 
optimum L-forest shown in Figure 2(a). GoeWill might also return a different forest of 
the same weight: Instead of creating a component of size four, it can take two vertical 
edges of weights s and 2e. However, the resulting L-cycle covers will be equal. 

Starting with the output of GoeWill, ApxUndir chooses greedily the bold edges, 
which have a weight of 1, rather than the two edges of weight 1 + e (Figure 2(b)). From 
the forest thus obtained, it constructs an L-cycle cover (Figure 2(c)). The weight of this 
L-cycle cover is 2(p/2 -|- 1) -|- (4p -|- 2)e. For sufficiently small e, this is approximately 
p + 2 = pl + 3, which is roughly pL /2 + 3/2 times as large as the weight of the optimum 
L-cycle cover. 

Of course, it would be desirable to have an approximation algorithm with a ratio that 
does not depend on L. Directly adapting the technique of Goemans and Williamson [15] 
does not seem to work: The function f{S) = 1 if and only ii \S\ ^ (L) is not proper 
because it violates symmetry. To force it to be symmetric, we can modify it to f'{S) = 1 
if and only if l^l ^ (L) or |y \ 51 ^ (L). But /' does not satisfy disjointness. There are 
generalizations of Goemans and Williamson's approximation technique to larger classes of 
functions [16]. However, it seems that L-cycle covers can hardly be modeled even by these 
more general functions. 

An alternative approach might be to grow a forest greedily without prior execution of 
GoeWill. This works if gL = 1- In this case, GoeWill outputs an empty forest anyway 
since f{S) = for all S CV, and ApxUndir boils down to a greedy algorithm. However, 
if gi > 1, then it is not guaranteed that we obtain a feasible forest at all. 

2.2 Unconditional Inapproximability of Min-L-UCC 

In this section, we provide a lower bound for the approximability of Min-L-UCC as a 
counterpart to the approximation algorithm of the previous section. We show that the 
problem cannot be approximated within a factor of 2 — e. This inapproximability result 
is unconditional, i. e., it does not rely on complexity theoretic assumptions like P 7^ NP. 

The key to the inapproximability of Min-L-UCC are immune sets [26]: An infinite 
set L C N is called an immune set if L does not contain an infinite recursively enumerable 
subset. Such sets exist. One might want to argue that inapproximability results based 
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(a) The output of GoeWill. (b) The final forest. 




(c) The L-cycle cover C"''. 



Figure 2: How ApxUndir computes an L-cycle cover of the graph of Figure 1(a). 

on immune sets are more of a theoretical interest. But our result limits the possibility of 
designing general approximation algorithms for L-cycle covers. To obtain algorithms with 
a ratio better than 2, we have to design algorithms tailored to specific sets L. 

Finite variations of immune sets are again immune sets. Thus for every k eN, there 
exist immune sets L containing no number smaller than k. 

Theorem 2.6. Let e > be arbitrarily small. Let k > 2/e, and let L C {k, k + 1, . . .} be 
an immune set. Then Min-L-UCC cannot be approximated within a factor of 2 — e. 

Proof. Let G„ be an undirected complete graph with n vertices {1, 2, . . . , n}. The weight 
of an edge {i, j'} for i < j is min{j — i,n + i — j}. This means that the vertices are ordered 
along an undirected cycle, and the distance from i to j is the number of edges that have to 
be traversed in order to get from i to j. These edge weights fulfill the triangle inequality. 

For all n L, the optimal L-cycle cover of G„ is a Hamiltonian cycle of weight n. 
Furthermore, the weight of every cycle c that traverses £ < n/2 vertices has a weight of 
at least 2£ — 2: Let i and j be two vertices of c that are farthest apart according to the 
edge lengths of G„. Assume that i < j. By the triangle inequality, the weight of c is 
at least 2 • min{j — i,n + i — j}. Since i < n/2 and by the choice of i and j, we have 
min{j — i,n + i — j} > i — 1, which proves w{c) >2i — 2. 

Consider any approximation algorithm APPROX for Min-L-UCC. We run APPROX on 
G„ for n G N. By outputting the cycle lengths occurring in the L-cycle cover of G„ for all 
n, we obtain an enumeration of a subset S L. Since L is immune, S must be a finite 
set, and s = max(5) exists. Let n > 2s. The L-cyclc cover output for Gn consists of 
cycles whose lengths are at most s < n/2. Since min(L) > k, we also have min(6') > k 
and the L-cycle cover output for Gn consists of at most n/k cycles. Hence, the weight of 
the cycle cover computed by AppROX is at least | • {2k — 2). For n e L, this is a factor 
of2 — |>2 — £ away from the optimum solution. □ 
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Algorithm 2 APxDiR. 

Input: directed complete graph G = (V,£'), \V\ = n; edge weights w : E satisfying 

the triangle inequality 
Output: an L-cycle cover C^p^ of G if n is L-admissible, _L otherwise 
1: if n ^ (L) then 

2: return _L 

3: construct an undirected complete graph Gjj = {V, En) with edge weights wu{{u, v}) = 

w{u, v) + w{v^ u) 
4: run ApxUndir on Gu and wu to obtain 
5: for all cycles cjj of C^f^ do 

6: Cjj corresponds to a cycle of G that can be oriented in two ways; put the orientation 

c that yields less weight into C^p^ 
7: return C^^^ 



Theorem 2.6 is tight since L-cycle covers can be approximated within a factor of 2 by 
L'-cycle covers for every set L' C L with (L') = (L). For finite sets L' , all L'-cycle cover 
problems are NP optimization problems. This means that in principle optimum solutions 
can be found, although this may take exponential time. The following Theorem 2.7 holds 
in particular for finite sets L' . In order to actually get an approximation algorithm for 
Min-L-UCC out of it, we have to solve Min-L'-UCC finite L', which is NP-hard and APX- 
hard. But the proof of Theorem 2.7 shows also that any approximation algorithm for 
Min-L'-UCC for finite sets L' that achieves an approximation ratio of r can be turned into 
an approximation algorithm for the general problem with a ratio of 2r. 

Let mini(G, w) denote the weight of a minimum- weight L-cycle cover of G with edge 
weights w, which have to fulfill the triangle inequality. 

Theorem 2.7. Let L QU be a non-em,pty set, and let L' C L with {L') = (L). Then we 
have mini,/(G, u)) < 2 • miyiL{G,w) for all undirected graphs G with edge weights w that 
satisfy the triangle inequality. 

Proof. Consider an arbitrary L-cycle cover C and any of its cycles c of length A € L. To 
prove the theorem, we show how to obtain an L'-cycle cover C" from C with w{C') < 

2 • w{C). Consider any cycle c of G that has a length of A. If A G L', we simply put c into 
G'. Otherwise, since (L') = (L) 5 L, there exist Ai, . . . , A^ G L' for some fe G N such that 

Aj = A. We remove k edges from c to obtain k paths consisting of Ai, . . . , A^ vertices. 
No additional weight is incurred in this way. Then we connect the respective endpoints of 
each path to obtain k cycles of lengths Ai, . . . , Afc. By the triangle inequality, the weight 
of an edge added to close a cycle is at most the weight of the corresponding path. By 
performing this for every cycle of G, we obtain an L'-cycle cover G' as claimed. □ 

3 Approximability of Min-L-DCC 

3.1 An Approximation Algorithm for Min-X-DCC 

In this section, we present an approximation algorithm for Min-L-DCC. The algorithm 
exploits ApxUndir to achieve an approximation ratio of 0(n). The hidden factor depends 
on PL again. This result matches asymptotically the lower bound of Section 3.2 and shows 
that Min-L-DCC can be approximated at least to some extent. (For instance, without 
the triangle inequality, no polynomial-time algorithm achieves a ratio of 0(exp(n)) for an 
NP-hard L-cycle cover problem unless P = NP.) 
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In order to approximate Min-L-DCC, we reduce the problem to a variant of Min-L- 
UCC, where also 2-cycles are allowed: We obtain a 2-cycle of an undirected graph by 
taking an edge {u, v} twice. Let G = {V, E) be a directed complete graph with n vertices 
and edge weights w : E ^ 'H that fulfill the triangle inequality. The corresponding 
undirected complete graph Gu = {V,Eu) has weights wu '■ Eu N with wu{{u^v}) = 
w{u, v) + w{v, u). 

Let C be any cycle cover of G. The corresponding cycle cover Cu of Gu is given by 
Cu = {{u,v} I {u,v) G C}. Note that we consider Cu as a multiset: If both {u,v) and 
{v,u) are in C, i. e., u and v form a 2-cycle, then {u,v} occurs twice in Cu- Let us bound 
the weight of Cu in terms of the weight of C. 

Lemma 3.1. For every cycle cover C of G, we have wu{Cu) < n ■ w{C). 

Proof. Consider any edge e = (u, v) G C, and let c be the cycle of length A that contains 

e. By the triangle inequality, we have wu{{u,v}) = w{u,v) + w{v,u) < w{c). Let cu be 
the cycle of Cu that corresponds to c. Since c consists of A edges, we obtain wu{cu) < 
A • w{c) < n ■ w{c). Summing over all cycles of C completes the proof. □ 

Our algorithm computes an L'-cycle cover for some finite L' Q L with {L') = (L). 
As in Section 2.1, the weight of the cycle cover computed is compared to an optimum 
(L)-cycle cover rather than an optimum L-cycle cover. Thus, we can again assume that 
already L is a finite set. 

The algorithm ApxUndir, which was designed for undirected graphs, remains to be 
an 0(1) approximation if we allow 2 G L. The numbers pu and gu are defined in the same 
way as in Section 2.1. 

Let C^"^ be the L-cycle cover output by ApxUndir on Gu- We transfer C^'^ into 
an L-cycle cover C^^^ of G. For every cycle cu of C^^, we can orient the corresponding 
directed cycle c in two directions. We take the orientation that yields less weight, thus 
w{C''P'') < wu{C^^)/2. OveraU, we obtain ApxDiR (Algorithm 2), which achieves an 
approximation ratio of 0{n) for every L. 

Theorem 3.2. For every L C V, ApxDiR is a factor (2n • (pu + 4)) approximation 
algorithm for Min-L-DCC. Its running-time is 0(n^ log n). 

Proof. We start by estimating the approximation ratio. Theorem 2.5 yields wu{C^^^) < 
4 • {pl + 4) • wuiC^j), where is an optimal (L)-cycle cover of Gu- Now consider an 
optimum (L)-cycle cover C* of G. Lemma 3.1 yields wu{C^) < n ■ w{C*). Overall, 

w{C^n < \ ■ wuiC^n < 2 • (PL + 4) • wuiClj) < 2 • (PL + 4) • n • w{C*)- 

The running-time is dominated by the time needed to execute GoeWill in ApxUndir, 
which is 0{n^ logn). □ 

3.2 Unconditional Inapproximability of Min-L-DCC 

For undirected graphs, both Max-L-UCC and Min-L-UCC can be approximated efficiently 
to within constant factors. Surprisingly, in case of directed graphs, this holds only for 
the maximization variant of the directed L-cycle cover problem. Min-L-DCC cannot be 
approximated within a factor of o(n) for certain sets L, where n is the number of vertices 
of the input graph. In particular, ApxDir achieves asymptotically optimal approximation 
ratios for Min-L-DCC. 
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One might again want to argue that such an inapproximabihty result is more of the- 
oretical interest. But, similar to the case of Min-L-UCC, this result shows that to find 
approximation algorithms, specific properties of the sets L have to be exploited. A general 
algorithm with a good approximation ratio for all sets L does not exist. Furthermore, as 
we will discuss in Section 3.3, Min-L-DCC seems to be much harder a problem than the 
other three variants, even for more practical sets L. 

Theorem 3.3. Let L C.U be an immune set. Then no approximation algorithm for Min- 
L-DCC achieves an approximation ratio of o{n), where n is the number of vertices of the 
input graph. 

Proof. Let G„ be a directed complete graph with n vertices {1, 2, . . . ,n}. The weight of 
an edge (i, j) is (j — i) mod n. This means that the vertices are ordered along a directed 
cycle, and the distance from i to j is the number of edges that have to be traversed in 
order to get from i to j. These edge weights fulfill the triangle inequality. 

For all n L, the optimal L-cycle cover of G„ is a Hamiltonian cycle of weight n. 
Furthermore, the weight of every cycle that traverses some of G^s vertices has a weight 
of at least n: Let i and j be two traversed vertices with i < j. By the triangle inequality, 
the path from i to j has a weight of at least j — i while the path from j to i has a weight 
of at least i — j + n = (i — j) mod n. 

Consider any approximation algorithm APPROX for Min-L-DCC. We run Approx on 
G„ for n G N. By outputting the cycle lengths occurring in the L-cycle cover of G„ for 
all n = 1, 2, . . ., we obtain an enumeration of a subset S C. L. Since L is immune, S is 
a finite set, and s = max(S') exists. Thus, the L-cycle cover output for G„ consists of at 
least n/s cycles and has a weight of at least n^/s. For n € L, this is a factor of n/s away 
from the optimum solution, where s is a constant that depends only on Approx. Thus, 
no recursive algorithm can achieve an approximation ratio of o(n). □ 

Min-L'-DCC for a finite set L' is an NP optimization problem. Thus, it can be solved, 
although this may take exponential time. Therefore, the following result shows that Min- 
L-DCC can be approximated for all L within a ratio of n/s for arbitrarily large constants 
s, although this may also take exponential time. In this sense. Theorem 3.3 is tight. We 
will first prove a lemma, which we will also use to prove Theorem 4.1. 

Lemma 3.4. For every L C N and every s > 1, there exists a finite set L' C L with 
(L') = (L) and the following property: For every X E L\ L' , there exist Ai, . . . , G L' 
with z < X/s such that Yli=i — ^■ 

Proof. If L is finite, we simply choose L' = L. So we assume that L is infinite. Let again 
ql denote the greatest common divisor of all numbers of L. Let us first describe how to 
proceed if gi € L. After that we deal with the case that gi ^ L. 

Let L' = {A G L I A < m}, and let £ G L'. If m is sufficiently large, then (L') = (L) 
(this follows from the proof of Lemma 2.1 [25, Lem. 3.1] and also implicitly from this 
proof). We will specify i and m, which depend on s, later on. 

Let X E L\L'. Thus, X > m. Let r = mod(A,£). Since A and I arc divisible by 
gi, also r is divisible by gi. Since A ^ L', we have to find Ai, A2, . . . G L' that add up 
to A. We have A = \_X/i\ ■ i + {r/gi) ■ gi- Now we choose Ai = . . . = A|^a/^j = I and 
A|^;^/^j_i_i = ... = A|^;^/£j+r/g^ = gi. What remains is to show that [A/£J +r/gL < X/s. 
To do this, we choose i > s. Since r/g^ is bounded from above by t/gi^ which does not 
depend on A, we obtain [A/£J + r / gi < X/ s for all A > m for some sufficiently large m. 
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The case that gi ^ L remains to be considered. There exist 7ri,...,7rp G L and 
^1, . . . , € Z for some p € N with gi = YTi=i ii'^i- Without loss of generahty, we assume 
that ^1 = mini<j<p4i- We have < since gi ^ L. 

As above, let L' = {A G L | A < m}, and let I G V. Let t = -^i£ ■ XlLi > 0- We 
choose m to be larger than £* . Let \> m, and let r = mod (A — ^* , ^) . Then 



A = 




A 



QL 

^■^ i=l i=l 



We have Pi = — ^i^ > 0: Since .^i < 0, we have > 0. If > 0, then of course 

Pi > 0. If < 0, then — < — ^i, and Pi > follows from r < i. According to the 



deliberations above, we choose Ai 



A 



£. In addition, we set pi of the 



AjS to TTi for 1 < z < p. It remains to be shown that [(A — i*)/£\ + Yli=i Pi ^ ^/^- This 
follows from the fact that pi < i • {^i/gi — Ci) foi" ^ h which is independent of A. Again, 
we choose I > s and m sufficiently large to complete the proof. □ 

Theorem 3.5. For every L and every s > 1, there exists a finite set L' Q L with {L') = (L) 
such that mini/ (G,w) < j ■ mini,(G, w) for all directed graphs G with edge weights w. 

Proof. Let s > 1 and L C D be given. We choose L' Q L as described in the proof of 
Lemma 3.4. In order to prove the theorem, let G be a directed complete graph, and let C 
be an L-cycle cover of minimum weight of G. We show that we can find an L'-cycle cover 
C' withu;(C") < f •u;(C). 

The L'-cycle cover C' contains all cycles of C whose lengths are in L'. Now consider 
any cycle c of length A G L \ L'. According to Lemma 3.4, there exist Ai, . . . , A2 £ L' with 
J2i=i Aj = A and z < X/s. We decompose c into z cycles of length Ai, . . . , A^. By the 
triangle inequality, the weight of each of these new cycles is at most 'w{c). Thus, the total 
weight of all z cycles is at most z ■ w{c) < (X/s) ■ w{c) < (n/s) ■ w{c). By performing this 
for all cycles of C, we obtain an L'-cycle cover C' with mini/(G) < w{C') < (n/s) ■w{C) = 
{n/s) ■ mini(G). □ 



3.3 RemEirks on the Approximability of Min-L-DCC 

It might seem surprising that Min-L-DCC is much harder to approximate than Min-L- 

UCC or the maximization problems Max-L-UCC and Max-L-DCC. In the following, we 
give some reasons why Min-L-DCC is more difficult than the other three L-cycle cover 
problems. In particular, even for "easy" sets L, for which membership testing can be done 
in polynomial time, it seems that Min-L-DCC is much harder to approximate than the 
other three variants. 

Why is minimization harder than maximization? To get a good approximation ratio 
in the case of maximization problems, it suffices to detect a few "good", i. e., heavy edges. 
If we have a decent fraction of the heaviest edges, their total weight is already within a 
constant factor of the weight of an optimal L-cycle cover. In order to form an L-cycle 
cover, we have to connect the heavy edges Tising other edges. These other edges might be 
of little weight, but they do not decrease the weight that we have already obtained from 
the heavy edges. 

Now consider the problem of finding cycle covers of minimum weight. It does not 
suffice to detect a couple of "good", i. e., light edges: Once we have selected a couple of 
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good edges, we might have to connect them with heavy-weight edges. These heavy-weight 
edges can worsen the approximation ratio dramatically. 

Why is Min-L-DCC harder than Min-L-UCC? If we have a cycle in an undirected 
graph whose length is in (L) but not in L (or not in L' but we do not know whether it 
is in L), then we can decompose it into smaller cycles all lengths of which are in L. This 
can be done such that the weight at most doubles (see Section 2). However, this does not 
work for directed cycles as we have seen in the proof of Theorem 3.3: By decomposing a 
long cycle into smaller ones, the weight can increase tremendously. 

Finally, a question that arises naturally is whether we can do better if all allowed cycle 
lengths are known a priori. This can be achieved by restricting ourselves to sets L that 
allow efficient membership testing. Another option is to include the allowed cycle lengths 
in the input, i. e., in addition to an n- vertex graph and edge weights, we are given a subset 
of {2, 3, ... , n} of allowed cycle lengths. 

The cycle cover problem with cycle lengths included in the input contains the ATSP 
as a special case: for an n-vertex graph, we allow only cycles of length n. Any constant 
factor approximation for this variant would thus immediately lead to a constant factor 
approximation for the ATSP. Despite a considerable amount of research devoted to the 
ATSP in the past decades, no such algorithm has been found yet. This is an indication 
that finding a constant factor approximation for the more general problem of computing 
directed cycle covers might be difficult. 

Now consider the restriction to sets L for which {1'^ | A G L} is in P (Min-L-DCC is 
an NP optimization problem for all such L). If we had a factor r approximation algorithm 
for Min-L-DCC for such L, where r is independent of L, we would obtain a c • logra 
approximation algorithm for the ATSP, where c > can be made arbitrarily small: In 
particular, such an algorithm for Min-L-DCC would allow for an r approximation of Min- 
fe-DCC for all /c G N. A close look at the (logn) approximation algorithm for ATSP of 
Frieze et al. [13] shows that an r-approximation for fe-cycle covers would yield an (r -log^ n) 
approximation for the ATSP. We have r ■ logj^n = • logn. Thus, by increasing k, we 
can make c = arbitrarily small. This would improve dramatically over the currently 
best approximation ratio of 0.842 • log2 n [21]. 

4 Properties of Maximum- weight Cycle Covers 

To contrast our results for Min-L-UCC and Min-L-DCC, we show that their maximization 
counterparts Max-L-UCC and Max-L-DCC can, at least in principle, be approximated 
arbitrarily well; their inapproximability is solely due to their APX-hardness and not to 
the difficulties arising from undecidable sets L. In other words, the lower bounds for Min- 
L-UCC and Min-L-DCC presented in this paper are based on the hardness of deciding 
whether certain lengths are in L. The inapproximability of Max-L-UCC and Max-L-DCC 
is based on the difficulty of finding good L-cycle covers rather than testing whether they 
are L-cycle covers. 

Let maxi(G, tt)) be the weight of a maximum-weight L-cycle cover of G with edge 
weights w. The edge weights w do not have to fulfill the triangle inequality. We will 
show that maxi(G, w) can be approximated arbitrarily well by maxLi{G, w) for finite sets 
L' C L with (L') = (L). Thus, any approximation algorithm for Max-L'-UCC or Max- 
L'-DCC for finite sets L' immediately yields an approximation algorithm for general sets 
L with an only negligibly worse approximation ratio. The following theorem for directed 
cycle covers contains the case of undirected graphs as a special case. 
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Theorem 4.1. Let L QD be any non-empty set, and let e > 0. Then there exists a finite 
subset L' O L with {L') = {L) such that niaxL'{G, w) > {1 — e) ■ maxL(G, w) for all graphs 
G with edge weights w. 

Proof. Let e > be given. We choose s > 1 with 1/s < e. According to Lemma 3.4, there 
exists a finite set L' C. L with (L') = (L) with the fohowing property: For all A G L \ L', 
there exist Ai, . . . , G L' for z < A/s < eA that sum up to A. Let us compare maxi/(G) 
and maxi(G). Therefore, let C be an optimum L-cycle cover. We show how to obtain an 
L'-cycle cover C from C. The L'-cycle cover C contains all cycles of C whose lengths 
are in L'. Let us consider any cycle c of length A G L\ L'. There exist Ai, . . . , A^ G L' 
for some z < eX that sum up to A. We break z edges of c to obtain a collection of paths 
of lengths Ai — 1, . . . , A^ — 1. Since we break at most an e fraction of c's edges, we can 
remove these z edges such that at most an e fraction of w{c) is lost. Then we connect the 
respective endpoints of each path to obtain z cycles of lengths Ai, . . . , A^. No weight is 
lost in this way. 

We have lost at most e ■ w{c) of the weight of every cycle c of C, thus max£,/(G) > 
w{C')>{l-e)-w{C) = {l-e)-maxLiG). □ 

5 Concluding Remarks 

First of all, we would like to know whether there is a general upper bound for the approx- 
imability of Min-L-UCC: Does there exist an r (independent of L) such that Min-L-UCC 
can be approximated within a factor of r? We conjecture that such an algorithm exists. 
If such an algorithm works also for the slightly more general problem Min-L-UCC with 
2 G -L (see Section 3.1), then we would obtain a factor rn/2 approximation for Min-L-DCC 
as well. 

While the problem of computing L-cycle cover of minimum weight can be approxi- 
mated efficiently in the case of undirected graphs, the directed variant seems to be much 
harder. We are interested in developing approximation algorithms for Min-L-DCC for 
particular sets L or for certain classes of sets L. For instance, how well can Min-L-DCC 
be approximated if L is a finite set? Are there non-constant lower bounds for the ap- 
proximability of Min-L-DCC, for instance bounds depending on max(L)? Because of the 
similarities between Min-L-DCC and ATSP, an answer to either question would hopefully 
also shed some light on the approximability of the ATSP. 
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